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Abstract 

In this paper, we introduce a natural class of multigraphs called hierarchical-scale-free 
(HSF) multigraphs, and consider constant-time testability on the class. We show that a very 
wide subclass, specifically, that in which the power-law exponent is greater than two, of HSF 
is hyperfinite. Based on this result, an algorithm for a deterministic partitioning oracle can 
be constructed. We conclude by showing that every property is constant-time testable on the 
above subclass of HSF. This algorithm utilizes findings by Newman and Sohler of STOC’ll. 
However, their algorithm is based on the bounded-degree model, while it is known that actual 
scale-free networks usually include hubs, which have a very large degree. HSF is based on 
scale-free properties and includes such hubs. This is the first universal result of constant-time 
testability on the general graph model, and it has the potential to be applicable on a very 
wide range of scale-free networks. 
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1 Introduction 


How to handle big data is a very important issue in computer science. In the theoretical 
area, developing efficient algorithms for handling big data is an urgent task. For this purpose, 
sublinear-time algorithms look like they could be powerful tools, as they are able to read very 
small parts (constant size) of inputs. 

Property testing is the most well-studied area in sublinear-time algorithms. A testing algo¬ 
rithm (or a tester) for a property accepts an input if it has the stipulated property and rejects it 
if it is far away from having the stipulated property with a high probability (e.g., at least 2/3) 
by reading a constant part of the input. A property is said to be testable if there is a tester [9], 

Property testing of graph properties has been well studied and many fruitful results have 
been obtained ©Mina mm Bang. Testers on the graphs are separated into three groups 
according to model: the dense-graph model (the adjacent-matrix model), the bounded-degree 
model, and the general model. The dense-graph model is the best clarified: In this model, 
the characteristics of testable properties have been obtained [2j. However, graphs based on 
actual networks are usually sparse and unfortunately the dense-graph model has been found 
not to work. Studies on the bounded-degree model have been proceeding recently. One of the 
most important findings for this model is that every minor-closed property is testable [3]. This 
result can be extended to the surprising result that every property of a hyperhnite graph is 
testable m- However, graphs based on actual models have no degree bounds, i.e., it is known 
that web-graphs have hubs [I, 16], which have a large degree, and, unfortunately once again, 
these algorithms do not work for them. 

Typical big-data graph models are scale-free networks, which are characterized by the power- 
law degree distribution. Many models have been proposed for scale-free networks ESEHHEl 
□HUH ED E21E3U2U- Recently, a promising model based on another property of a hierarchical 
isomorphic structure has been presented: If we look at a graph in a broad perspective, we find a 
similar structure to local structures. Shigezumi, Uno, and Watanabe [22] presented a model that 
is based on the idea of the hierarchical isomorphic structure of power-law distribution of isolated 
cliques. An idea of isolated cliques was given by Ito and Iwama mm, and the definition is 
as follows. For a nonnegative integer c > 0, a c-isolated clique is a clique such that the number 
of outgoing edges (edges between the clique and the other vertices) is less than ck , where k is 
the number of vertices of the clique. A 1-isolated clique is sometimes simply called an isolated 
clique. 

Based on the model of [22], we introduce a class of multigraphs, hierarchical scale-free multi¬ 
graphs (HSF, Definitions ofl. which represents natural scale-free networks. We show the 
following result (Theorem [T79]) : 

Every property is testable on HSF if the power-law exponents are greater than two. 

Given this result, many problems on actual scale-free big networks will prove to be solvable 
in constant time. Although this result is an application of the algorithms of [20], which is a 
result on bounded-degree graphs, HSF is not a class of bounded-degree graphs. This is the first 
result on universal algorithms for the general graph model. 

1.1 Definitions 

In this paper, we consider undirected multigraphs without self-loops. We simply call this type 
of multigraph a “graph” in this paper and use G = (V, E) to denote it, where V is the vertex set 

1 In a preliminary version of this paper, m, the definition of HSF is different. The definition in this paper is 
far more general (wider) than in the preliminary version. 
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and E is the edge (multi)set. Sometimes V and E are denoted by V[G\ and E[G\, respectively. 
Henceforth, we use “set” to refer to a multiset for notational simplicity. Throughout this paper, 
n is used to denote the number of vertices of a graph, i.e., \V\ = n. 

For a graph G = (V, E) and vertex subsets X. Y C V. Eg(X, Y ) denotes the edge set between 
X and Y, i.e., Eq(X,Y ) = {(x,y) £ E \ x £ X, y £ Y}. Eq{X,V\X) is also simply written as 
Eg(X). \Eq(X)\ is denoted by dc{X). For a vertex v £ V, the number of edges incident to v 
is called the degree of v. A singleton set {x} is often written as x for notational simplicity. E.g., 
the degree of v is represented by dciy). The subscript G in the above Eq(*), etc., may 

be omitted if it is clear. 

For a vertex v £ V, Tg(v) denotes the set of vertices adjacent to v, i.e., Tc/t) := {tt £ 
V | (v,u) £ E}. Note that |Fg(u)| may not be equal to dciy) as parallel edges may exist. 
For a graph G = (V,E) and a vertex subset X C V, the subgraph induced by X is defined as 
G(X) = (X,{(u,v) £ E | u,v £ X}). 

For a vertex subset X C V, a contraction of X is defined as an operation to (i) replace X 
with a new vertex vx, (ii) replace each edge (v,u) in E(X) (v £ X, u £ V\X) with a new edge 
(vx,u), and (iii) remove all edges between vertices in X. That is, by contracting X C V, a 
graph G = (V, E) is changed to G' = (V 7 , E') such that 

V' = V\X U {v x }: and 

E' = E\{(v, u ) \ v £ X, u £ H} U {{vx, u) \ (v, u) £ E, v £ X, u £ V — X}. 

We identify the above ( vx,u ) £ E' with (v,u) £ E. In other words, we say that (v,u) remains 
in G' (as (vx , it)). Note that the graphs are multigraphs, and thus if there are two edges 
(v,u), (v',u) £ E for v,v' £ X, v / v' and u £ H\X, then two parallel edges, both represented 
by (vx- u), one of which corresponds to (v,u) and the other of which corresponds to (v',u), are 
added to E' . Also note that none of the graphs considered in this paper contain self-loops, and 
hence an edge (v,v') £ E with v, v' £ X is removed by contracting X. 

Two graphs G\ = (V\,E\) and G 2 = (V 2 , E 2 ) are isomorphic if there is a one-to-one cor¬ 
respondence : V\ —> V 2 such that Eq 1 {u,v ) = Eg 2 {$(,u),$(v)) for all u, v £ V\. A graph 
property (or property, for short) is a (possibly infinite) family of graphs, which is closed under 
isomorphism. 

Definitions 1.1 (e-far and e-close) Let G = (V,E) and G' = (' V',E') be two graphs with 
|V| = \V'\ = n vertices. Let m(G,G') be the number of edges that need to be deleted and/or 
inserted from G in order to make it isomorphic to G. The distance between G and G 1 is 
defined aJl dist(G, G') = m(G,G')/n. We say that G and G' are e-far */dist(G, G) > e; 
otherwise e-close. Let P be a non-empty property. The distance between G and P is dist(G, P) = 
mhiG"£P dist(G, G"). Otherwise we say that G is e-far from P ifdist(G,P) > e, and e-close. 

Definitions 1.2 (testers) A testing algorithm for a property P is an algorithm that, given 
query access to a graph G, accepts every graph from P with a probability of at least 2/3, and 
rejects every graph that is e-far from P with probability at least 2/3. Oracles in the general graph 
model are: for any vertex v, the algorithm may ask for the degree d{v), and may ask for the ith 
neighbor of the vertex (for 1 < i < d(v W0 The number of queries made by an algorithm to the 
given oracle is called the query complexity of the algorithm. If the query complexity of a testing 

2 The distance defined here may be larger than 1 as m(G, G') > n may occur. (In the bounded-degree model 
it is defined as dist(G, G') = m(G,G')/dn.) However, here we consider sparse graphs and they have an implicit 
upper bound of the average (not possibly maximum) degree, say d, and thus dist(G, G') is bounded by d. 

3 Although asking whether there is an edge between any two vertices is also allowed in the general graph model, 
the algorithms we use in this paper do not need to use this query. 
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algorithm is a constant, independent of n (but it may depend on e ), then the algorithm is called 
a tester. A (graph) property is testable if there is a tester for the property. 

Definitions 1.3 (isolated cliques j!4j ) For a graph G = (V,E) and a real number c > 0, a 
vertex subset Q C V is called a c-isolated clique if Q is a clique (i.e., (u,v) € E, for all u,v £ Q 
and u v) and dc(Q) < c\Q\. A 1-isolated clique is sometimes called an isolated clique. £(G) 
is the graph obtained from G by contracting all isolated cliques. Two distinct isolated cliques 
never overlap, except in the special case of double-isolated-cliques, which consists of two isolated 
cliques with size k sharing k — 1 vertices. A double-isolated-clique Q has no edge between Q 
and the other part of the graph (i.e., dc(Q) = 0), and thus we specially define that a double- 
isolated-clique in G is contracted into a vertex in £{G). Under this assumption, £(G) is uniquely 
defined. 

Definitions 1.4 (hyperfinite [7]) For real numbers t > 0 and e > 0, a graph G = (V, E) 
consisting of n vertices is (t, e)-hyperfinite if one can remove at most en edges from G and 
obtain a graph whose connected components have size at most t. For the function p : R + —> R + , 
G is p-hyperfinite if it is (p(e),e) -hyperfinite for all e > 0. A family Q of graphs is p- hyperfinite 
if all G £ Q are p-hyperfinite. A family Q of graphs is hyperfinite if there exists a function p 
such that Q is p-hyperfinite. 

Hyperfinite is a large class, as it is known that any minor-closed property is hyperfinite in a 
bounded-degree model. From the viewpoint of testing, the importance of hyperfiniteness stems 
from the following result. 

Theorem 1.5 ( [201 ) For the bounded-degree model, any property is testable for any class of 
hyperfinite graphs. 

This result is very strong, but there is a problem in that the result works on bounded-degree 
graphs and it is natural to consider that actual scale-free networks do not have a degree bound. 

1.2 Our contribution and related work 

In this paper, we apply the universal algorithm of m to scale-free networks. We formalize two 
natural classes, SF and TLSF that represent scale-free network^. The latter is a subclass of 
the former. 

Definitions 1.6 For positive real numbers c > 1 and 7 > 1, a class of scale-free graphs (SF) 
SF(c,^y) consists of (multi)graphs G = (V. E) for which the following condition holds: 

(i) Let Vi be the number of vertices v with d(v) = i. Then: 

Vi < cnE 1 , Vz € {2 ,3,... , }. (1) 

The above property (i) is generally called a power-law and in many actual scale-free networks, 
it is said that 2 < 7 < 3 p]. That is, SF is a class of multigraphs that obey the power-law 
degree distribution. 

We show that this class is e-close to a bounded-degree class if 7 > 2 (Lemma 12.11) . 

4 T-LSF was introduced in the preliminary version of this paper [13]. However, the definition in this paper is 
more general (wider) than in the preliminary version. 
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After showing this property, we show the hyperfiniteness of the class. Hyperfiniteness seems 
to be closely related to a high clustering coefficient, where the cluster coefficient cl(G') of a graph 
G = (V,E) is defined a^|: 


cl(G) := - c 1 g(* 0 , cIg('c) : = 

n z ' 


vev 


\{(u,v) <E E | u,v € T G (v),u ^ u}| 
(l r oMI) 


Sometimes c1g(u) is called the local cluster coefficient of v . It is said that cl(G) is 0(1) for 
many classes that model actual social networks, while lintn^oo cl(G) = 0 for random graphs. 

These three characterizations, “high clustering coefficient,” “existence of isolated cliques,” 
and “hyperfiniteness” appear to be closely related to each other. In fact, it is readily observed 
that if c1g(u) = 1 for a bounded-degree graph G (the degree bound is d ), then G consists of only 
(completely) isolated cliques with size at most d + 1 , and G is (d+1, 0)-hyperhnite! 

Unfortunately, however, it is also observed that for any 0 < c < 1, there is a class of 
bounded-degree graphs G such that lin^^oo cl(G) = c and it is not ( t , e)-hyperfinite for any 
pair of constants t and e < 1/2, e.g., G = (V,E) consists of n/d cliques of size d, and random 
n /2 edges between vertices in different cliques (each vertex has d — 1 adjacent vertices in its 
clique and one adjacent vertex outside the clique). To separate this graph into constant-sized 
connected components, almost all of the edges between cliques (their number is en/ 2 ) must be 
removed. 

However, we do not need to give up here, as the above model does not look like a natural 
model of scale-free networks, e.g., by contracting each isolated clique, it becomes a mere random 
graph with n/d vertices. Prom this fact, the hierarchical structure of a high cluster coefficient 
looks important. The model presented by m has such a structure. Based on this model, we 
present the following class of multigraphs: 


Definitions 1.7 (Hierarchical Scale-Free Graphs) For positive real numbers c, 7 > 1 and 

a positive integer no > 1, a class of hierarchical scale-free graphs (HSF) 'HSJ 5 * 7 = FLSE{c, 7 , no) 
consists of (multi)graphs G = (V,E) for which the following conditions hold: 

( 1 ) G e SE(c, 7 ) 

(ii) Consider the infinite sequence of graphs Go = G, G\ = £{Gq), G 2 = £{G\), .... If 
|U[G/]| > no, then Gi includes at least one isolated clique Q C V with \Q\ > 2. (Note that 
if Gk has no such isolated clique, then Gk = Gk+i = Gk +2 = ■■■■) 


We show the following results. 


Theorem 1.8 For any USE = HSE(c, 7 , no) with 7 > 2 and any real number e > 0, there is 
a real number = tnjUSE, e) such that USE is (froi, e)-hyperfinite. 

We give a global algorithm for obtaining the partition realizing the hyperfiniteness of The¬ 
orem Ol The algorithm is deterministic, i.e., if a graph and the parameter e are fixed, then 
the partition is also fixed. The algorithm can be easily revised to a local algorithm and we ob¬ 
tain a deterministic partitioning oracle to get the partition (Lamrna 13.21) . Note that all known 
partitioning algorithms are randomized algorithms. By using this partitioning oracle and an 
argument similar to one used in m, we get the following main theorem. 

Theorem 1.9 Any property is testable for HSE(c, 7 , no) with 7 > 2. 

5 There is another way to define the cluster coefficient: 3x (# of cycles of length three)/(# of paths of length two). 

Although these two values are different generally, they are close under the assumption of the power-law degree 

distribution. 
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Related work As stated earlier, for the bounded-degree model, Newman and Sohler [20] 
presented a universal tester (which can test any property) for hyperfinite graphs. In the general 
graph model, there exist far fewer results than for the bounded-degree graph model and the 
dense graph model. For universal-type sublinear-time algorithms, Kusumoto and Yoshida m 
gave a testing algorithm with plylog(n) query complexity for forest-isomorphism. In the same 
paper, they showed an fl(logn) lower bound for this problem. 

This paper gives a universal tester that can test every property on a natural class of scale-free 
multigraphs in constant time. This is the first result for universal constant-time algorithms on 
sparse and degree-unbounded graphs. 

2 Hyperfiniteness and a Global Partitioning Algorithm 

2.1 Degree bounding 

For a graph G and a nonnegative integer d > 0, G\d is a graph made by deleting all edges 
incident to each vertex v with d(v) > d from G. Note that G\d is a bounded-degree graph with 
degree bound d. 

Lemma 2.1 For any SF = SF(c, 7 ) with 7 > 2, and any positive real number e > 0, there is 
a constant fen = fen(e, c, 7 ) such that for any graph G € SF, G |fefl is e-close to G. 

Before showing a proof of this lemma, we introduce some definitions. Riemann zeta function 
is defined by ("( 7 ) = This function is known to converge to a constant (£( 7 ) < 

1 + (7 — lfe 1 ) for any 7 > 1. We introduce a generalization of this function by using a positive 
integer k > 1 as ((k, 7 ) = Note that £( 7 ) = £( 1 , 7 ). 

Lemma 2.2 For any e > 0 and 7 > 1, there is an integer A |p = fem(e, 7 ) > 1 such that 
C(%3 7) < e- 

Proof: It is clear from the above fact that £( 7 ) converges for every 7 > 1. □ 

Proof of Lemma \2.1\ Let d be an arbitrary positive integer. Let be the number of removed 
edges to make G\d from G. From ([!]), 

OO OO 

md = ^2 ivi < ^2 cni~ < '' y ~ 1 ' 1 = cnf(d + 1,7 — 1 ). 
i=d-\-1 i=d-\-1 

From the assumption of 7 > 2 and Lemma [2721 + 1,7 — 1) < e/c if d + 1 > ferfL/c ,7 - !)• 

Thus by letting 5 = fep(e, c, 7 ) = fe^i (e/c, 7 — 1) — 1, we have < en. □ 

From here, we denote the above fepfe, c, 7 ) by 6 for notational simplicity. 

2.2 Hierarchical contraction, structure tree, and coloring 

Let W 1 ,...,W k (Wi C V, Mi £ {1,...,L}) be a family of subsets of vertices satisfying that 
Wi PI Wj = 0 for every i,j G (1,..., k} and i j, and W\ U • • • U W k = V. Then {W \,..., W k } is 
called a partition of V. Below, we explain a global algorithm for obtaining a partition of V real¬ 
izing the hyperfiniteness of a graph in 7-LSF with 7 > 2 , i.e., | Wj \ is bounded by a constant and 
the number of edges between different Wi and Wj is, at most, en. First, we give a base algorithm. 
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procedure HierarchicalContraction(G) 

begin 

1 i ■= 0, G 0 := G 

2 while there exists an isolated clique in G* = ( Vi,Ei ) do 

3 i := i + 1, Gj := £{Gi-1 ) 

4 enddo 
end. 


We denote Gj = ( Vi,Ei ) for i € {0,1,...}. Let G& = (14,-E*.) be the final graph of 
HierarchicalContraction(G). From the definitions of HSF, |14| < N. See Fig. |T] (a)- 
(c) for an example of applying this procedure. 



Figure 1: An example of HierarchicalContraction, the structure tree T, and the coloring: 
Here, we assume 5/e = 4.5; the number beside a vertex is w{*)\ the dotted circles are isolated 
cliques; colored areas are blue or yellow components. 

The trail of the contraction can be represented by a rooted tree T = (V[T], E[T]), which is 
called the structure tree of G, defined as follows. (Fig. (T] (d) shows an example of the structure 
tre^fl) 

V[T] := Vq U Vi U • • • U 14 U {r}, where r is the (artificial) root of T. Each v € Vo is a leaf 
of T, and a vertex v £ V (i £ {0,... , A;}) is on the level i of T, i.e., v £ Vi (i > 1) is the parent 
of u £ Vi -1 if “v is made by contracting a subset (clique or a double-isolated-clique) Q C V—i 
such that u G Q' or “v = u (i.e., u is not included in an isolated clique in Gj_i).” The root r 
is the parent of every vertex in 14. (The reason r is added is only to make T a tree.) 

6 In this example, we ignore 8 and the red vertices. Some may feel it curious that if 8/e — 4.5, then 8 « 4.5 
for small e, and hence many vertices in this example become red. This is correct. However, w(v) becomes larger, 
step by step, and thus if k is very large, then w(v) may be larger than 8/e , by only contracting vertices whose 
degree is at most J. 
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We introduce a function W : P[T]\{r} —> 2^ and coloring on the vertices in V[T\ as follows: 

• For v £ Vo: 

— W(y) = {u}, and 

— if d(v) > 5, then v is colored red, otherwise uncolored. 

• For v £ Vi (i = 1,..., k): 

— let S(v) be the set of uncolored children of v, 

— W(v) = U« e s(«)^(“)> and 

— if |W(w)| > 5/e, then v is colored blue, 

— else if v £ 14 and W(v) ^ 0 , then v is colored yellow, 

— otherwise, v is uncolored. 

Note that for any two distinct colored vertices u,v £ V[T], W(u) fl W(v) = 0 . For every 
v £ V[T], we also define a weight function as w(v) = |W(v)|. For a blue (resp. yellow) colored 
vertex v £ P[T], W(v) C V is called a blue (resp. yellow) component. 

By using these colors, we also color the edges in E (= Eq) in the following manner: 

• For every red vertex v £ Vo (= V), all edges in Eq(v ) are colored red. 

• For every blue component W C V, for every edge e £ Eq{W), if e is not colored red, then 
e is colored blue. 

• For every yellow component W C V, for every edge e £ Eq(W), if e is not colored either 
red or blue, then e is colored yellow. 

The other edges in E are uncolored. The set of red, blue, and yellow edges in E are repre¬ 
sented by R, B, and Y, respectively. These colors are preserved in G\ = £(G$), G 2 = £(G 1 ), 

..., G/. = £{Gk~ 1 ), e.g., if an edge e £ Ei is red, then the corresponding edge in Ei + \ is also 
red. 

2.3 Proof of Theorem 11.81 

Before showing the proof of Theorem 11.81 we prepare some lemmas. 

Lemma 2.3 For any Gi (i £ {0,..., k}), all edges incident to a vertex with a degree higher 
than 5 are red. 

Proof: For Go = G, the statement clearly holds from the coloring rule. Assume that the 
statement holds in G*_ 1 , and does not hold in some Gj. Let v be a vertex in V) such that 
dGi(v ) >5 + 1 and a non-red edge is incident to v. Then v must be made by contracting an 
isolated clique in G,_ 1 , say Q C V)_i, such that dc i _ 1 (Q ) >5+1. From the definition of isolated 
cliques, \Q\ > dc i _ 1 (Q) + 1 > 5 + 2. Since Q is a clique, every vertex Q has degree at least 
| Q| — 1 > 5 + 1 in Gj_i. It follows that all edges incident to a vertex in Q must be red. This 
contradicts the assumption that a non-red edge is incident to r. □ 

Lemma 2.4 \R\, \B\ < en, \Y\ < 5no/2. 
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Proof: |i?| < en is directly obtained from Lemma 12.11 Let v £ V) be a blue vertex such that a 
non-red edge exists in E(W{v)). From Lemma 12.31 d(W(v)) < 5. Thus d(W(v))/w(W(v )) < 
<5/(<5/e) = e. This means that the average number of blue edges per a vertex is less than e. 
Therefore \B\ < en. From Lemma 12.31 all edges incident to a vertex with degree higher than 5 
are red. From this it follows that the number of non-red edges in E^ is at most 5|I4|/2. Thus 
the number of yellow edges in E is also at most 5|I4|/2. By considering [Vy < no, we have 
\Y\ < dno/2. □ 

Let vf -, ..., vjf r be the red vertices ( k r is the number of red vertices). Let Wf, ..., be 
the blue components (k b is the number of blue components). Let W/ , .. W/. be the yellow 
components (k y is the number of yellow components). We consider a family of vertex subsets as 

V:= {{uf} | i = l,...,k r }U{W l B | i = l,...,k b }U{Wf | i = l,...,k y }. 

From the definition of the function W and the coloring. V is clearly a partition of V. 

Now we can prove Theorem 11.81 

Proof of Theorem. \1.8\ If n < 5no/(2e), then the statement is clear by setting t > 5no/(2e). 

Thus, we assume that n > 5no/(2e). Let G' be a graph obtained by deleting all red, blue, and 

yellow edges from G. From Lemma 12.41 the number of deleted edges is less than 

2 en + <5rio/2 < 3en. (2) 

Next, we will show that the maximum size of connected components in G' is at most 5(5+1)/e. 
Assume that there exists a connected component G'(X ) = (X,Ex) consisting of more than 
5(5 + l)/e vertices in G'. X includes no vertex v with dc(v) > 5, since from Lemma f2.3l all edges 
in Eq(v) are colored red. Moreover, there is no blue component W C V such that X fl W ^ 0 
and X\W / 0, as otherwise X would be disconnected in G' (by deleting blue edges). 

From this it follows that there is a blue or yellow component W including X. Let x £ V[T] 
be the (blue or yellow) vertex such that W = W(x). If x is a yellow vertex, then w(v) < 5/e (as 
otherwise v would be colored blue), and |A'| < w(v ) < 5/e < 5(5 + l)/e, which is a contradiction. 
Thus x must be a blue vertex. Assume that x £ V/. Let Y C I4_i be the set of children of x 
(in T). Y consists of an isolated clique or a double-isolated-clique in Gh-i■ For every vertex 
y £ y, dc h _ 1 (y) < 5 (from Lemma 12731) . Thus |T| < 5 + 1. 

Let Y' CL be the set of uncolored children of x. For v € Y'. w(v) < 5/e. Hence, 

w(x ) = ^ 1^1 ' <V e — 1^1 ' V e < (5 + l)5/e, 

veY' 

which is a contradiction. Therefore, the maximum size of connected components in G' is 5(5 + 
l)/e. 

Thus, we have proved that G is (max{5rao/(2e), 5(5 + 1)/e}, 3e)-hyperfinite. Here, e is an 
arbitrary real number in (0,1], then by defining frj~gi = max{35rao/(2e),35(5 + l)/e}, G is (fpl, e)- 
hyperfinite for any e > 0. □ 


3 Testing Algorithm 

3.1 Deterministic partitioning oracle 

The global partitioning algorithm of Theorem 11.81 can be easily revised to run locally, i.e., a 
“partitioning oracle” based on this algorithm can be obtained. A partitioning oracle, which 


calculates a partition realizing hyperfiniteness locally, was introduced by Benjamini, et al. [3] 
implicitly and by Hassidim, et al. [ 12 ] explicitly. It is a powerful tool for constructing constant¬ 
time algorithms for sparse graphs. It has been revised by some researchers and Levi and Ron’s 
algorithm [T8j is the fastest to date. All algorithms for partitioning oracles presented to date 
have been randomized algorithms. Our algorithm, however, does not use any random valuable 
and it runs deterministically. That is, we call it a deterministic partitioning oracle, which is 
rigorously defined as followqj: 

Definitions 3.1 O is a deterministic (f, e) -partitioning oracle for a class of graphs C, if, given 
query access to a graph G = (V,E), it provides query access to a partition V of G. For a query 
about v G V, O returns V(v). The partition has the following properties: (i) V is a function ofG, 
t, and e. (It does not depend on the order of queries to O.) (ii) For every v G V, l'P(u)! < t and 
V{y) induces a connected subgraph ofG. (in) IfG^C, then |{(u,u) G E \ V(u) ^ V(v)}\ < e\V\. 

Lemma 3.2 There is a deterministic (frOl e)-partitioning oracle Ohsf for HSF with 7 > 2 
with query complexity S °^ 2 / e+n °) for one query. 

Before giving a proof of this lemma, we introduce some notation as follows. A connected 
graph G = ( V, E) with a specified marked vertex v is called a rooted graph, and we sometimes 
say that G is rooted at v. A rooted graph G = ( V, E) has a radius t, if every vertex in V has 
a distance at most t from the root v. Two rooted graphs are isomorphic if there is a graph iso¬ 
morphism between these graphs that identifies the roots with each other. We denote by N ( d, t ) 
the number of all non-isomorphic rooted graphs with a maximum degree of d and a maximum 
radius of t. For a graph G = (V, E), integers d and t, and a vertex v G V, let Bq{v, d, t ) be the 
subgraph rooted at v that is induced by all vertices of G\d that are at distance t or less from 
v. Bc(v,d,t ) is called a ( d,t)-disk around v. From these definitions, the number of possible 
non-isomorphic (d, f)-disks is at most N(d,t). 

Proof of Lemma \3.2\ The global algorithm of Theorem 11.81 can be easily simulated locally. To 
find V(v), if d{v) > 6, then the algorithm outputs V(v) := {u}. Otherwise, if the algorithm 
finds a vertex u with d{u) > 5 in the process of the local search, u is ignored (the algorithm 
does not check the neighbors of u ). Thus, the algorithm behaves as on the bounded-degree 
model. For any vertex v, \P(v)\ < i pl = 0{5 2 /e). Each u G Bq(v,5,\ rgj) may be included 
in V(w) of w G Bg(u, 5, ffTsl)- Then, the algorithm checks most vertices in Bg(v , S, 2frj-gi) = 
Bg(v, 6, O(d 2 /e + no )), and thus the query complexity for one call of V(v) is at most ^°{ &2 N+ n 0 ). 

□ 


3.2 Abstract of the algorithm 

The method of constructing a testing algorithm based on the partitioning oracle of Lemma [3.21 is 
almost the same as the method used in [[20]. We use a distribution vector, which will be defined 
in Definition 13.31 of rooted subgraphs consisting of at most a constant number of vertices. 

Definitions 3.3 For a graph G = (V,E) and integers d and t, let disk c{d,t) be the distribution 
vector of all ( d,t)-disks of G, i.e., disk c(d,t) is a vector of dimension N(d,t). Each entry of 
disk c(d,t) corresponds to some fixed rooted graph H, and counts the number of ( d,t)-disks of 
G\d that are isomorphic to H. Note that G\d has n = \V\ different disks, thus the sum of 
entries in disk c{d,t) is n. Let iveq G (d,t) be the normalized distribution, namely freq G (d, t) = 
disk c{d,t)/n. 

' However, since Levi and Ron’s algorithm [18] looks fast, using it may be better in practice. 
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For a vector v = (vi,... ,v r ), its fanorm is ||w||i = YJi= 1 k*|- The 1 \-norm is also the length 
of the vector. We say that the two unit-length vectors v and u are e-close for e > 0 if ||n—rt||i < e. 

By using a discussion that is the same as in Theorem 3.1 in [20], the following lemma is 
proved. 

Lemma 3.4 There exist functions )\ 374 ] = (fa 4 ] = d^jTLFS, e), teH = t^jT-LFS, e), 

= JSfe-jf'HFS, e) such that for every e > 0 the following holds: For every G\ , G 2 £ TiFS on 
n > A fcn vertices, if |freq Gl (fat fell) - freq Ga (fef tell) | < > 101 , then G\ and G 2 are e-close. □ 

A sketch of the algorithm is as follows. Let G = (V, E) be a given graph and P be a property 

to test. First, we select some (constant) number t = i(e) of vertices Vi € V (i = 1,... ,£) and 

find V(vi) given by Theorem 11.81 This is done locally (shown by Lemma 13.21) . Consider a 

graph G’ := V{v\) U ••• U V{v(). Here, freq G (d, t) and freq G i(d,t) are very close with high 

provability. Next, we calculate min Gg p |freq G /(d, t) — freq G (d, t)\ approximately. There is a 

problem in that the number of graphs in P is generally infinite. However, to approximate it 

with a small error is adequate for our objective, and thus it is sufficient to compare G' with a 

constant number of vectors of freq(d, t). (Note that calculating such a set of frequency vectors 

requires much time. However, we can say that there exists such a set. This means that the 

existence of the algorithm is assured.) The algorithm accepts G if the approximate distance of 

min Gg p |freq G /(d, t) — freq G (d, t)\ is small enough, and otherwise it is rejected. 

The above algorithm is the same as the algorithm presented in [20] except for two points: 

in our model: (1) G is not a bounded-degree graph, and (2) G is a multigraph. However, these 

differences are trivial. For the first difference, it is enough to add an ignoring-large-degree-vertex 

process, i.e., if the algorithm find a vertex v having a degree larger than fa all edges incident 

to v are ignored. By adding this process, G is regarded as G |efan This modification does not 

effect the result by Lemma 12.11 For the second difference, the algorithm treats bounded-degree 

graphs as mentioned above, and the number of non-isomorphic multigraphs with n vertices and 

2 

degree upper bound (fan is finite (bounded by Q(tfa if )). 

Proof of Theorem \1.9\ Obtained from the above discussion. □ 

4 Summary and future work 

We presented a natural class of multigraphs TLSF representing scale-free networks, and we 
showed that a very wide subclass of it is hyperfmite (Theorem 11.81) . By using this result, the 
useful result that every property is testable on the class (Theorem 11.9ft is obtained. 

HSF is a class of multigraphs based on the hierarchical structure of isolated cliques. We 
may relax “isolated cliques” to “isolated dense subgraphs [14] ” and we may introduce a wider 
class. We consider such classes also to be hyperfinite. Finding such classes and proving their 
hyperfiniteness is important future work. 
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